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The 3C-like proteinase (3CL?"°) of the severe acute respiratory 
syndrome (SARS) coronavirus plays a vital role in virus maturation 
and is proposed to be a key target for drug design against SARS. 
Various in vitro studies revealed that only the dimer of the matured 
3CLP"® is active. However, as the internally encoded 3CL?*® gets 
matured from the replicase polyprotein by autolytic cleavage at 
both the N-terminal and the C-terminal flanking sites, it is unclear 
whether the polyprotein also needs to dimerize first for its auto- 
cleavage reaction. We constructed a large protein containing the 
cyan fluorescent protein (C), the N-terminal flanking substrate 
peptide of SARS 3CLP*”® (XX), SARS 3CL?"® (3CLP), and the yellow 
fluorescent protein (Y) to study the autoprocessing of 3CL?’° using 
fluorescence resonance energy transfer. In contrast to the matured 
3CLP"°, the polyprotein, as well as the one-step digested product, 
3CLP- Y-His, were shown to be monomeric in gel filtration and ana- 
lytic ultracentrifuge analysis. However, dimers can still be induced 
and detected when incubating these large proteins with a substrate 
analog compound in both chemical cross-linking experiments and 
analytic ultracentrifuge analysis. We also measured enzyme activ- 
ity under different enzyme concentrations and found a clear tend- 
ency of substrate-induced dimer formation. Based on these discov- 
eries, we conclude that substrate-induced dimerization is essential 
for the activity of SARS-3CL»” in the polyprotein, and a modified 
model for the 3CL?"° maturation process was proposed. As many 
viral proteases undergo a similar maturation process, this model 
might be generally applicable. 


Shortly after its outbreak in 2003, severe acute respiratory 
syndrome (SARS)? was confirmed to be caused by a new type of 
coronavirus, SARS-CoV. Similar to other coronaviruses, two- 
thirds of its genome encodes two large replicase polyproteins, 
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pp la (450 kDa) and pp lab (750 kDa), which will undergo exten- 
sive proteolytic processing mainly by the internally encoded 
main proteinase (also called 3C-like proteinase, 3CLP”°) to pro- 
duce multiple functional subunits that mediate both genome 
replication and transcription (1). 

The crystal structure of SARS wild type 3CLP*® is a 
homodimer (Protein Data Bank codes 1Q2W, 1UJ1, and 1UK2) 
and topologically similar to the other coronaviruses such as 
transmissible gastroenteritis coronavirus and human coronavi- 
rus 229E (2, 3) (Protein Data Bank codes 1LVO and 1P9U). One 
protomer of 3CL?"*® consists of three domains, of which the first 
two form a chymotrypsin fold, and is connected by a long loop 
with the third extra helix domain. The catalytic dyad, His-41 
and Cys-145, locates in the deep cleft between domain I and II. 
We have shown that the dimer is the biologically functional 
form of matured 3CLP”® (4), and only one monomer is active (5). 
Certain single site mutations, such as M6A, G11A, $39A, and 
R298A, result in inactive monomers (6 —9). 

Dimerization is a commonly used strategy in viral protease 
activity regulation (10). Many of the reported viral proteases are 
active only in dimer form (4, 11-14). For enzymes with an active 
site formed by residues from both the protomers, like HIV-1 
protease (15), it is straightforward to understand why the dimer 
formation is necessary. However, for proteases with a complete 
active site in one protomer, dimerization may be one method to 
regulate its activity. It is interesting to know what happens for 
these proteases before they have been cleaved out from the 
polyprotein as it might be difficult for a large flexible polypro- 
tein to form a stable dimer. A recent study revealed that the 
mini-precursor of HIV-1 protease formed a highly transient but 
low populated dimeric structure during maturation (16). 

For SARS 3CL?"°, much has been learned about its catalytic 
mechanism (17), substrate specificity (18), as well as inhibitor 
design (19, 20). However, few studies have been reported on its 
maturation mechanism. Shan et al. (21) introduced a 31-mer 
peptide containing an autocleavage site flanking to the N ter- 
minus of 3CL?"® to test the in cis activity of SARS 3CL?”®. They 
found that the peptide can be autocleaved efficiently by 3CLP*® 
itself by monitoring the autocleavage products on the gel. Hsu 
et al. (22) reported that the SARS 3CLP*® can be matured from 
polyprotein with flanking N- and C-terminal segments in vitro. 
The N terminus is digested faster than the C terminus, and 
during digestion, the 3CL?"° with 10 residues attached to both 
the N terminus and the C terminus can form dimer. However, 
10 residues flanking at the N terminus and C terminus might be 
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too short and may not reflect the real situation in the polypro- 
tein. To understand the maturation and activity regulation 
mechanism of SARS 3CLP*®, we constructed an artificial 
polyprotein containing cyan fluorescent protein (C), the 
N-terminal natural flanking substrate peptide of SARS 3CLP"®, 
SITSAVLQ (XX), SARS 3CLP*° (3CLP), and yellow fluorescent 
protein (Y) and used it to study the autoprocessing mechanism 
of SARS 3CLP"® polyprotein. In contrast to the dimerization of 
the matured enzyme, this polyprotein and its N-terminal 
cleaved product were found to be monomeric in conventional 
analysis but are still active. 


EXPERIMENTAL PROCEDURES 


Plasmid Construction of the Large Fusion Proteins His-C- 
XX(Q/E)-3CLP- Y, His-C-XX-C145A-Y, 3CLP-Y-His, and 3CLP- 
MBP-His—The large fusion protein constructs basically con- 
tained four components in sequence. The first component is C, 
and the second component is XX, which is the natural auto- 
cleavage substrate peptide of SARS-CoV 3CLP"® derived from 
the natural ppla/pplab polyprotein attached to its N terminus. 
Sequence analysis revealed 11 cleavage sites of SARS 3CLP*® on 
the SARS polyproteins, all of which contain a highly conserved 
substrate sequence (P1 Gln | P1’ Ser/Ala) (23). The N-terminal 
autolytic substrate peptide (SITSAVLQ | SGF) has the highest 
cleavage efficiency among all 11 substrate peptides (4). The 
third component consists of 3CLP or mutants, and the 
fourth component is Y, so the construct is named 
C-XX-3CLP-Y. 

To prepare His-C-XX(Q/E)-3CLP-Y and His-C-XX-C145A-Y, 
a cloning vector pET28a-CY2.0 with His, tag at the N terminus 
was built first (supplemental Fig. $1). Detailed experimental 
procedures can be found in the supplemental material. Here, 
Q/E represents the mutation of Gln in the extra 8-amino acid 
peptide (SITSAVLQ) to Glu. Mutation of the critical residue 
Gln at the P1 position to Glu completely abolishes the hydro- 
lytic activity of SARS 3CLP*® (18). 

3CLP-Y-His is the truncated protein of His-C-XX(Q/E)- 
3CLP-Y, which also had a His, tag at the C terminus for further 
protein purification (supplemental Fig. $1). 3CLP-MBP-His 
was a plasmid in which the YFP of 3CLP- Y-His was replaced by 
maltose-binding protein (MBP) (supplemental Fig. S1). 

All the plasmids were constructed following the methods 
listed in the supplemental material, and primers needed were 
listed in supplemental Table $1. All constructs were confirmed 
by DNA sequencing (Invitrogen). 

Fusion Protein Expression and Purification—The large poly- 
proteins with His, tags were expressed and purified with a 
slightly modified procedure when compared with the matured 
enzyme (4) (see detailed information in supplemental material). 
The method for wild type SARS-CoV 3CL proteinase expres- 
sion and purification was the same as reported before (17). 

Analytic Gel Filtration Analysis—The aggregation state of 
the large fusion proteins, His-C-XX(Q/E)-3CLP-Y and His- 
C-XX-C145A-Y, and the truncated one, 3CLP-Y-His, were 
analyzed using a Superdex 200 HR 10/300 GL column (GE 
Healthcare) on AKTA fast protein liquid chromatography. 
The purified proteins, at different concentrations, were 
loaded on the column, which was pre-equilibrated to 36 ml 
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of buffer A (40 mm phosphate-buffered saline, pH 7.3, 100 
mM NaCl, 1 mm EDTA). The column was eluted with another 
36 ml of buffer A at flow rate of 0.5 ml/min, during which the 
eluted volume of the fraction peak at 280 nm was monitored 
on fast protein liquid chromatography. The concentrations 
loaded onto the column were: 1) His-C-XX(Q/E)-3CLP-Y, 
4.4 and 10 mg/ml; 2) His-C-XX-C145A-Y, 4.4 and 10 mg/ml; 
and 3) 3CLP-Y-His, 6 mg/ml. Gel filtration molecular weight 
markers used and the standard calibration curve can be 
found in supplemental Table S2 and supplemental Fig. S3. 

Analytic Ultracentrifugation (AUC) Analysis—Sedimenta- 
tion velocity experiments were carried out using a Beckman 
Optima XLA analytical ultracentrifuge. The procedure of sed- 
imentation velocity was the same as reported previously (6). 
His-C-XX(Q/E)-3CLP-Y, His-C-XX-C145A-Y, and 3CLP-Y- 
His were prepared in buffer A. Samples (1 mg/ml, 380 pl) and 
reference (400 pl) were loaded into double-sector centerpieces. 
Data were analyzed with Sedfit version 11.71. 

In Vitro Trans-cleavage Activity Assay—7.5 uM His-C-XX(Q/ 
E)-3CLP-Y (or 3CLP- Y-His or 3CLP-MBP-His) as the enzyme 
and 22.5 uM His-C-XX-C145A-Y as the substrate were mixed 
and incubated in buffer A with 5 mo dithiothreitol at 37 °C with 
500 rpm of shaking for 2 h. The substrate His-C-XX-C145A-Y 
was cleaved at the substrate peptide Q | S bond to release two 
fragments, which eliminated FRET and resulted in a decrease at 
527 nm. To confirm the molecular weight of the two products, 
sample aliquots of 10 pl were taken out at different reaction 
times. For enzyme His-C-XX(Q/E)-3CLP-Y, samples were 
taken out at 0, 30, 60, 90, and 120 min and put on ice. In addi- 
tion, for 3CLP-Y-His, samples were taken out at 1, 2, 3, 4, 5, 6, 8, 
10, 15, 20, and 30 min. One part of the reaction mixture (5 pl) 
was diluted 40 times into ice-cold buffer A for FRET, and the 
other 5 pl was mixed with 5 pl of 2X SDS-loading buffer for 
SDS-PAGE on 10% polyacrylamide gel. Control assays were 
also performed by incubating 7.5 wm His-C-XX(Q/E)-3CLP-Y 
(or 3CLP-Y-His) and 22.5 wm His-C-XX-C145A-Y alone at the 
same conditions as the first test described above. 

Kinetic Measurement of Enzyme Activity—A colorimetric 
substrate, Thr-Ser-Ala-Val-Leu-Gln-pNA (GL Biochemistry 
Ltd.), was used for enzyme concentration-dependent kinetic 
measurement (17). The substrate was cleaved at the Gln-pNA 
bond to release free pNA, resulting in an increase of absorbance 
at 390 nm (measured using Synergy'™4, BIOTEK). Enzyme 
concentration ranges used here were 2.25—45 um for His-C- 
XX(Q/E)-3CLP-Y, 1.36—21.78 pum for 3CLP- Y-His, and 0.225- 
4.5 uM for SARS-3CL?"® in a 100-pl volume, respectively. The 
enzyme concentration dependence of rate constants was 
measured at 37 °C in buffer A with 5 mm dithiothreitol. The 
apparent second-order rate constant k,.,/K,,, was calculated 
by dividing the pseudo-first-order constant by the enzyme 
concentration (17, 24). We have derived a fitting model to 
obtain intrinsic k.,,/K,, (24). To be brief, the apparent 
enzyme activity was contributed from the dimers as well as 
the monomers in solution. 


(Kes Km) app = (Keg Ka rane meas! GE ss (Kat! Kn) dimerCdimer/Ce 


(orn, 
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in which (k,,,/K,,,)app is the apparent second-order rate con- 
stant, (k..4/K,,,) mono 2nd (Keat/K,,) dimer are that of the monomer 
and dimers, and C, is the total enzyme concentration. C 
and Caimer 

respectively. 


As the monomers can associate into dimers, we get 


mono 


are the concentration of monomer and dimer, 


M+M=M, K, = [M,|/[M}? (Eq. 2) 
C,=[M] + 2[M,| (Eq. 3) 
thus 
Cmono = (—1 + (1 + 8K,C,)"”?)/4K, (Eq. 4) 
Caimer = (4K,Cz + 1 — (1 + 8K,Cz)""*)/8K, (Eq. 5) 


in which M is monomer, M, is dimer, and K, is the monomer- 
dimer association constant. 

We have tried to fit Equation 1 and found that the activities of 
monomers were negligible for the matured 3CLP"®, as well as 
the polyproteins. This confirmed again that dimer is the active 
form. We then omitted the contribution from the monomers 
and fitted the data using Origin 8.0 (OriginLab) according to 
the following equation (24). 


ais = (Kat! Kin) dimer(4KaCe eh. (1 a 8K,Cz)"*)/8K.Ce 
(Eq. 6) 


The k..,/K,, of dimer and its association constant can be 
obtained from the fitting. 

Inhibition Efficiency of the Isatin Compound 5f on His-Y- 
XX(Q/E)-3CLP-Y—A known inhibitor, the isatin derivative, 
1-(2-naphthlmethyl) isatin-5-carboxamide (5f) (25), was dis- 
solved in DMSO to test the inhibition efficiency toward the 
polyprotein. To reduce the high background of FRET, another 
large fusion protein plasmid, His- Y-XX(Q/E)-3CLP- Y, was pre- 
pared as above, replacing the first cyan fluorescent protein by 
YFP, which was used as the enzyme for the inhibition test. 7.5 
uM. His- Y-XX(Q/E)-3CLP-Y and 50 um 5f (control was 5% 
DMSO) were preincubated in buffer A at 37 °C for 15 min, and 
then 22.5 wm His-C-XX-C145A-Y (substrate) was added to the 
mixture to react for 2 h. Samples were taken at different reac- 
tion times for FRET measurement. 

Chemical Cross-linking—Enzymes (6 mg/ml His-C-XX(Q/ 
E)-3CLP-Y, 6 mg/ml 3CLP-Y-His, and 5.12 mg/ml SARS- 
3CLP*°) alone or mixed with equal molar 5f were cross-linked 
by 10-fold molar excess ethylene glycol bis (succinimidyl succi- 
nate) in buffer A at room temperature for 30 min and then 
quenched by adding Tris (1 M, pH 7.5) to a final concentration 
of 50 mM. 


RESULTS 


Aggregation State of the Free Polyproteins—We investigated 
the aggregation states of the fusion proteins using gel filtration 
and analytic ultracentrifugation to see whether they can still 
form dimers in solution. To avoid autocleavage, two mutants, 
His-C-XX(Q/E)-3CLP-Y and His-C-XX-C145A-Y, were used 
for aggregation studies. The Gln to Glu mutation makes the 
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FIGURE 1. Sedimentation coefficient distribution c(s) profile of His-C- 
XX(Q/E)-3CLP-Y, His-C-XX-C145A-Y, and 3CLP-Y-His at 40,000 rpm and 
20 °C. 


substrate sequence XX uncleavable, whereas the Cys-145 to Ala 
mutation results an inactive enzyme. 

Analytic Gel Filtration Analysis—Both His-C-XX(Q/E)- 
3CLP-Y and His-C-XX-C145A-Y showed high FRET efficiency 
(supplemental Fig. S2), indicating that they were intact. A 
Superdex 200 HR 10/300 GL column was used to estimate their 
apparent molecular weights based on the retention volume 
at different protein concentrations (see details in the sup- 
plemental material). Both proteins were shown to be mono- 
meric even at high concentration of 10 mg/ml (supplemental 
Table S3 and supplemental Fig. S3). 

Analytic Ultracentrifugation Analysis—To further verify the 
ageregation state of the polyproteins, sedimentation velocity 
experiments were performed. The sedimentation experiments 
provide hydrodynamic information about the molecular size 
distribution and conformational changes in the native solution 
with no dilution effect when compared with gel filtration. Sed- 
imentation velocity was used to determine the sedimentation 
coefficient distribution c(s) and the molecular weight distribu- 
tion c(M) of polyproteins at 1 mg/ml. The resulting c(s) and 
c(M) distribution profiles showed that only one peak was 
detected (Fig. 1 and supplemental Fig. $4). Sedimentation coef- 
ficients of His-C-XX(Q/E)-3CLP-Y and His-C-XX-C145A-Y 
were ~4.9 and ~5.0 S, respectively. Based on the c(M) distri- 
bution model, the molecular masses of His-C-XX(Q/E)- 
3CLP-Y and His-C-XX-C145A-Y were estimated to be 82.7 and 
86.0 kDa (supplemental Table S4), indicating that both of them 
were monomeric. 

In Vitro Trans-cleavage Activity Assay of 3CL?’? in Poly- 
protein—The fusion polyproteins containing SARS 3CLP*° 
were shown to be monomeric in solution. To test its activity, 
trans-cleavage was monitored in vitro by FRET signal and con- 
firmed through SDS-PAGE. 
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FIGURE 2. The enzymatic activity of His-C-XX(Q/E)-3CLP-Y in vitro. A, FRET 
spectra at different reaction times. B, SDS-PAGE for the digestion at different 
times. From lanes 1-5, samples from 0-, 30-, 60-, 90-, and 120-min reaction 
times were loaded, respectively. The bands from top to bottom show the 
mixture of His-C-XX(Q/E)-3CLP-Y and His-C-XX-C145A-Y, the cleavage product 
|C145A-Y, and the cleavage product II His-C-XX. 


His-C-XX(Q/E)-3CLP- Y was used as the enzyme, and His-C- 
XX-C145A-Y was used as the substrate. Two control experi- 
ments were also carried out by incubating His-C-XX(Q/E)- 
3CLP-Y and His-C-XX-C145A-Y for 120 min, and the 
fluorescence spectra did not change. At the beginning of the 
reaction, a high efficiency FRET signal was detected, which 
decreased along with the digestion (Fig. 2), indicating that 
His-C-XX(Q/E)-3CLP-Y digested the substrate His-C-XX- 
C145A-Y well. Further supporting evidence was obtained from 
SDS-PAGE. In the beginning, only one band at 91 kDa was 
observed. After incubation for some time, the 91-kDa band 
became weaker. At the same time, two lower molecular mass 
bands appeared and turned thicker, indicating that His-C-XX- 
C145A-Y was digested into two parts (about 27 and 64 kDa). 
The monomeric polyprotein can still retain its enzymatic activ- 
ity in vitro. 

Properties of the Truncated Fusion Protein 3CLP- Y-His—The 
results shown above indicate that the 3CLP"® in the polyprotein 
appeared to be monomeric and was enzymatically active. We 
then wanted to know the property and activity of the N-termi- 
nal cleaved polyprotein. A truncated protein, 3CLP- Y-His, con- 
taining the 3CL?"°, the YFP protein, and a C-terminal His tag, 
was expressed and purified in vitro. The enzymatic activity was 
tested using His-C-XX-C145A-Y as substrate. 3CLP-Y-His 
digested the substrate His-C-XX-C145A-Y very rapidly. Within 
15 min, almost 95% of the substrate was digested (Fig. 3), which 
is much faster than the full polyprotein (Fig. 2). Gel filtration 
and analytic ultracentrifugation analysis showed that 3CLP- Y- 
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FIGURE 3. The enzymatic activity of 3CLP-Y-His in vitro. A, spectra of FRET 
monitoring of enzymatic activity of truncated fusion protein 3CLP-Y-His with 
different reaction times. B, samples from different reaction times were loaded 
onto SDS-PAGE. Lanes 1-4 show 0, 1, 15, and 30 min. The bands from top to 
bottom show His-C-XX-C145A-Y, a mixture of 3CLP-Y-His and cleavage prod- 
ucts |C145A-Y, and cleavage product II His-C-XX. 


His was also monomeric (Fig. 1, supplemental Figs. S3 and S4, 
and supplemental Tables S3 and S4). 

To eliminate the possible effect of the YFP protein in 3CLP- 
Y-His, we substituted it with MBP protein to construct 3CLP- 
MBP-His, which showed almost the same activity as 3CLP-Y- 
His. We concluded that after the first step of autocleavage, the 
product containing a free N terminus and one protein attached 
to the C terminus remained monomeric in solution and with 
higher enzyme activity than the polyprotein with proteins 
attached to both ends. This enzyme activity was not related to 
the proteins attached to the 3CLP termini. 

Inhibition Efficiency of an Isatin Inhibitor on His-Y- 
XX(Q/E)-3CLP-Y—The isatin derivative 5f is a non-covalent 
SARS coronavirus 3C-like protease inhibitor, with an IC., of 
0.37 um. We used 5f as a probe to test whether the substrate 
binding pocket conformation of 3CLP*° in the polyprotein 
remains the same as that in the free 3CL?"°. 5f can almost fully 
inhibit the enzymatic activity of His- Y-XX(Q/E)-3CLP-Y at 50 
LM (supplemental Fig. S5 and supplemental Table S5), implying 
that the substrate binding pocket conformation of 3CL?”® in the 
polyprotein is similar to that in the free 3CL?”®. 

Aggregation State of the Polyproteins with the Presence of Sub- 
strate or Substrate Analog—The experimental results above 
indicated that the polyproteins were monomeric in solution as 
free proteins. However, we still do not know whether they can 
form transient complexes in solution and what happens when 
they react with substrate. We did chemical cross-linking exper- 
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iments for the two polyproteins without and with the presence 
of a substrate analog, an isatin derivative 5f. Similar to the sub- 
strate, 5f was shown to be able to enhance dimer formation for 
SARS-3CLP"° in our previous study (24). From the cross-linking 
experiment, we can see that more dimers were formed for the 
wild type protein, the N-terminal cleaved protein, and the full- 
length large protein (Fig. 4). In fact, for the full-length large 
protein, the dimer band showed very weak, if any, formation, 
with EGS cross-linking in the absence of 5f, which turned out to 
be more obvious after cross-linking in the presence of 5f. 


A Bo = =C 
an Le | , mt 


FIGURE 4. SDS-PAGE of chemical cross-linking by EGS. A, wild type SARS- 
3CLP"° (5.12 mg/ml). B, 3CLP-Y-His (6 mg/ml). C, His-C-XX(Q/E)-3CLP-Y (6 
mg/ml). For each PAGE, from left to right, lane 1 is the enzyme only, lane 2 is 
enzyme with EGS, lane 3 is enzyme with equal molar substrate analog 5f, and 
lane 4 is enzyme with equal molar substrate analog 5f and EGS. 
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XX(Q/E)-3CLP-Y without and with the presence of isatin derivative 5f at 
40,000 rpm and 20 °C. 


Due to the experimental condition requirement for the 
cross-linking study, the protein concentration used is quite 
high. As such high concentration may not reflect the real situ- 
ation in vivo, we then analyzed the aggregation state of the 
polyprotein His-C-XX(Q/E)-3CLP-Y at a rational concentra- 
tion (1 mg/ml) in the presence of 5f (25) at 10 wm using AUC. 
From the AUC results (Fig. 5), it is clear that the polyprotein is 
monomeric without 5f and partly forms dimer upon the addi- 
tion of 5f. Thus with the presence of substrate, the polyproteins 
can also form dimers. 

Dependence of Enzyme Activity on Enzyme Concentration— 
As shown in our previous studies (4, 24), the enzyme concen- 
tration dependence of SARS-3CL?"° implies a dimer-only activ- 
ity control mechanism. For the polyproteins, we also measured 
their apparent k..,,/K,,, for pNA peptide substrate at different 
enzyme concentrations. The (K,,./K,,,)app increases along with 
the enzyme concentration increase in a similar way as the 
matured enzyme (Fig. 6). We also tested the concentration-de- 
pendent enzyme activity using the polyprotein His-C-XX- 
C145A-Y as substrate and observed a similar trend (data not 
shown). Together with the AUC analysis, we conclude that 
dimerization is also required for the polyprotein, although the 
dimer may only form transiently, which can be stabilized by 
substrate. We fitted the enzyme activity data using Equation 6 
to derive (k..,,/K,,, dimer and association constant K, (Table 1). 

From Table 1, we can see that among the three proteins con- 
taining 3CL?”®, the intact polyprotein has the lowest ability to 
form active dimer, with a dimer (substrate-induced) dissocia- 
tion constant of about 100 uM, whereas the N-terminal cleaved 
polyprotein has a somewhat stronger dimer formation ability 
(K,, around 42 um) and the free protein has the strongest dimer 
formation ability (K, around 1.6 pm). The catalytic activities of 
the dimers formed by the three proteins also follow the same 
order. As the apparent enzyme activity positively depends on 
the dimer activity and the dimer association constant, the intact 
polyprotein has quite low enzyme activity, and upon the first 
step of digestion, the activity of the truncated 3CLP- Y-His will 
be increased, and the matured 3CLP*® will have the highest 
enzyme activity. Although different substrates may give differ- 
ent quantitative results, the activities should follow the same 
order. 
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FIGURE 6. Enzyme activity of SARS-3CLP"°, 3CLP-YFP-His, and His-C-XX(Q/E)-3CLP-Y at different concentrations. The apparent velocity second-order 


constant (k.,,/K,,) 


app increases in the same manner as matured SARS-3CL°" as the enzyme concentration (Conc E) increases, indicating that the dimer form is 


the active form of the proteinase. ll, measured value of (k_¢/K,,) app: the solid line is the fitted curve. 
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TABLE 1 


Relative enzyme activities toward the pNA peptide substrate 


The proteolysis activity of all the enzymes at different concentrations were 
determined by incubating 0.2 mm pNA peptide with the enzymes in buffer A (40 
mM phosphate-buffered saline, pH 7.3, 100 mm NaCl, 1 mm EDTA) with 5 mm 
dithiothreitol. 


Enzyme Caer | Ge prea K,? K,? 

mu ‘+ min! pm" [LM 

His-C-XX(Q/E)-3CLP-Y 3.6 + 0.4 0.010 = 0.002 100 
3CLP-Y-His 6.4 + 0.6 0.024 + 0.004. 42, 
os) Oe 5L8 212 0.61 + 0.08 1.6 


“ Fitted and determined by Equation 6. 
> K, was derived from K,,. 


DISCUSSION 


Substrate-induced Dimerization Is Essential for the Activity of 
SARS-3CL?”? in the Polyprotein—In vitro studies showed that 
dimerization is essential for the enzyme activity of SARS- 
3CLP*° (4). However, whether dimerization is necessary for the 
activity of its precursor, the polyprotein, is still not clear. Hsu 
et al. (22) reported that the partially cleaved polyprotein can 
form a small amount of active dimer and proposed that during 
the autocleavage of the 3CLP"®, the enzyme is dimeric at each 
step. However, in their study, only 10 residues were attached to 
both the termini of SARS-3CLP"°, which may not be long 
enough to mimic the situation in the polyprotein as the hydro- 
phobic regions 1 and 2 (HD1 and HD 2) attached to the N and C 
terminus of 3CLP*° are more than 10 amino acids in the repli- 
case of coronavirus (26). 

In the current study, we used full-length folded proteins as an 
attachment to the two ends of 3CLP*° to mimic the case in the 
replicase. Our results showed that 3CL?*° in the polyprotein 
was monomeric in solution when standing alone, and the activ- 
ity was not related to the proteins attached as the polyprotein 
kept its activity when substituting YFP with MBP. Furthermore, 
we also discovered that the N-terminal cleaved product of the 
polyprotein, which occurs when only one protein is attached to 
the C terminus of 3CL?"®, was still active. These active polypro- 
teins can also be deactivated by the previously reported 3CLP"° 
inhibitor 5f, implying that the active site conformation in the 
polyprotein may be similar to that in the active protomer of the 
aw LP’ Gimer, 

Substrate-enhanced dimerization was observed for the 
matured SARS-3CLP"’® (24). The dissociation constant of pure 
SARS-3CLP*° was measured to be 14 uM using sedimentation 
equilibrium. However, for most in vitro assays for SARS- 
3CLP"°, the enzyme concentration was around 1 pM. As our 
previous study (4) and various other studies (19) have shown, 
only dimer is active for this enzyme. This brought a dilemma as 
the dimer concentration must be very low under the assay con- 
dition assuming a K, of 14 um. However, when substrate was 
present, the dimerization ability of SARS-3CLP"° was signifi- 
cantly increased with a K, around 1 uM under the experimental 
conditions (24). 

For the case of the polyproteins, we also tested their enzyme 
activity change along with the increase of enzyme concentra- 
tion. Similar to SARS-3CLP"°, the apparent second-order con- 
stant, (K.a:/K,,)app» increases with the enzyme concentration 
(Fig. 6). Thus we postulated that the polyprotein also needs to 
dimerize first to perform its enzymatic function. 


ASBIVIB 
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To verify this assumption, we also did EGS cross-linking and 
AUC analysis for the polyproteins with a substrate analog com- 
pound 5f. we found that significant dimer can be detected when 
both experiments were done in the presence of 5f. Thus we 
concluded that substrate-induced dimerization is essential for 
the activity of SARS-3CLP"® in the polyprotein. 

Possible Mechanism for the Cleavage Reaction of SARS- 
3CL?” in the Polyprotein—Although few autolytic studies have 
been reported for SARS 3CLP", 3CLP*® from other coronavi- 
ruses were studied in more detail. It was reported that the auto- 
processing of 3CLP*® was related to membrane (27-29), 
although the details of dynamic control during this process are 
still not clear. Our results indicate that although the polypro- 
tein remains a monomer as free protein, it can form transient 
dimers upon substrate binding to perform their catalytic 
activity. 

Hsu et al. (22) proposed a possible mechanism for the matu- 
ration process of SARS-3CLP*® from their crystal structure 
study of the C145A mutant. As the active site of one protomer 
in the C145A dimer binds with the C-terminal 6 amino acids of 
the protomer from another asymmetric unit, they believe that 
this kind of structure mimics the product-bound form in the 
maturation process. They also attached 10 extra residues at the 
N terminus and/or the C terminus of the protein and found that 
these proteins also form dimers, although weaker when com- 
pared with the matured enzyme. In their mechanism model, for 
the first and second step, two monomers of the polyprotein first 
cleave each other to cut off the N-terminal extra peptides and 
then form a dimer. That is, the first step of the reaction occurs 
between the two monomers within one dimer. 

However, our data showed that the polyproteins with extra 
proteins at both ends can still form dimeric structure under 
substrate induction condition, which prompts for an intermol- 
ecule reaction mechanism. For the case of HIV protease matu- 
ration, during the first step of digestion, the full-length HIV 
polyprotein also forms dimer by prerequisite and cuts the 
N-flanking site intramolecularly as the initial rates related to 
N-terminal autocleavage linearly increased along with the 
increase of the polyprotein concentration (30). In contrast, our 
results indicated that the initial rates for the SARS polyprotein 
related to the N-terminal or C-terminal autocleavage non-lin- 
early increased along with the increase of the polyprotein con- 
centration. That is, the SARS polyprotein may cut its termini 
intermolecularly. In fact, this is also in accordance with the 
crystal structure of Hsu et al. (22) as they observed that a dimer 
molecule of the C145A mutant binds with the C-terminal 6 
amino acids of the protomer from another asymmetric unit. 

Based on the above analysis, we proposed an updated version 
for the SARS-3CLP"® maturation mechanism (Fig. 7). Shortly 
after the polyprotein translation, a tiny amount of transient 
dimers can form, which become more stabilized by binding to 
its substrate (another polyprotein), and then an intermolecular 
reaction occurs to release the free N terminus of SARS-3CLP*° 
in the polyprotein. The one-step product, with a free N termi- 
nus but a restricted C terminus of 3CL?"®, can be induced into 
the dimer form more easily by the substrate and is more active 
than its precursor, which will make it act as the main enzyme to 
cut the N-terminal flanking site of the other molecules in the 
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FIGURE 7. Possible mechanism of 3CLP"° autoprocessing. In the first step, 
polyproteins A and B form a transient dimer, which cuts the N-terminal flank- 
ing site of molecule C (and D, etc.). In the second step, molecule C and D with 
a free N terminus but a restricted C terminus of 3CL°"° form a dimer and act as 
the main enzyme to cut molecule A or B through trans-cleavage. For the third 
step, the truncated polyprotein with free N terminus digested its C-terminal 
flanking site to release the free C terminus of 3CLP'° monomer. For the fourth 
step, monomers of the matured enzyme assembled into an active dimer of 
aC. 


Fourth step 


polyprotein in step two. In the third step, the one-step product 
digests its C-terminal flanking site to release the free C termi- 
nus of 3CLP*° monomer. In the fourth step, along with the accu- 
mulation of 3CLP*® monomers, active dimers were assembled 
by two protomers to form matured 3CLP". 

As the substrate-induced dimerization occurs at each step 
when compared with the conventional “power of two” mecha- 
nism (10), this “substrate-dependent power of two” mechanism 
provides an additional way of enzyme activity regulation. For 
example, at the very beginning of viral replication, only a very 
low concentration of the enzyme and the substrate is present, 
so nothing will happen. Along with the polyprotein synthesis, 
substrates are accumulated, which then induce more enzyme 
dimer formation and activate the process of the enzyme diges- 
tion. Then along with the digestion reaction, the concentration 
of substrate will be lowered, whereas the matured enzyme con- 
centration will be increased. As the matured enzyme has a 
stronger dimer association constant, its increased concentra- 
tion results in more active dimers to treat the low concentration 
of substrate until most substrates are consumed. This kind of 
enzyme activity regulation is a highly efficient and economic 
way of activity control and substance usage. We expect that in 
addition to other 3CLP"® proteinases, more examples of mul- 
tidomain proteins may also follow this type of dynamic control 
mechanism. 
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